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NOVEL RECEPTOR 
Field of the Invention 

The present invention is in the general field of biological receptors and the various uses 
that can be made of such receptors. More specifically, the invention relates to nucleic acids 
encoding novel G protein-coupled receptors and to the receptors per se. 

Background and Prior Art 

G protein-coupled receptors (GPCRs) constitute, a family of proteins sharing a common 
10 structural organization characterized by an extracellular N-terminal end, seven hydrophobic 
alpha heliees-putatively constituting transmembrane domains and an- intracellular C- 
terminal domain. GPCRs bind a wide variety of ligands that trigger intracellular signals 
through the activation of transducing G proteins (Caron, et al, Rec. Prog. Horm. Res. 
48:277-290 (1993); Freedman et al, Rec. Prog. Horm. Res. 51:319-353 (1996)). 

15 

More than 300 GPCRs have been cloned thus far and it is generally assumed that there 
exist well over 1000 such receptors. Mechanistically, approximately 50-60% of all 
clinically relevant drugs act by modulating the functions of various GPCRs (Cudermann, et 
al, J. MoL Med. 73:51-63 ( 1995)). Of particular interest are receptors located in dorsal root 
20 ganglia. This region of the central nervous system is densely innervated with primary or 
afferent sensory neurons involved in the transmission, modulation and sensation of pain. 
Thus, receptors from this region may be used in assays for the identification of new agents 
for anesthesia and analgesia 

25 Summary of the Invention 

The present invention is based upon the discovery of a novel G protein-coupled receptor 
which is distinct from previously reported receptors in terms of structure and in being 
expressed preferentially in dorsal root ganglia. One dorsal root receptor (DRR) has been 
isolated and sequenced from the rat and six from the human. The rat receptor was given the 



5 



designation rDRR-1 and its amino acid sequence is shown as SEQ ID NO: 1. The human 

receptors were designated as 

hDRR-1 (SEQ ID NO:3); 

hDRR-2 (SEQ ID NO: 5); 

hDRR-3 (SEQ ID NO:7): 

hDRR-4 (SEQ IDNO:9); 

hDRR-5 (SEQ ID NO: 11); and 

hDRR-6 (SEQ ID NO: 13). 

Unless otherwise specified, the term "DRR" as used herein refers to all of the receptors 
from both human and rat . 



In its first aspect, the invention is directed to proteins, except as existing in nature, 

comprising the amino acid sequence consisting functionally of a rat or human DRR as 
shown in SEQ ID NO:l, 3, 5, 7, 9, 11, or 13. The term "consisting functionally of is 
intended to include any receptor protein whose sequence has undergone additions, 
deletions or substitutions which do not substantially alter the functional characteristics of 
the receptor. Thus, the invention encompasses proteins having exactly the same amino 
acid sequence as shown in the sequence listing, as well as proteins with differences that are 
not substantial as evidenced by their retaining the basic, qualitative binding properties of 
the DRR receptor. The invention further encompasses substantially pure proteins consisting 
essentially of a DRR amino acid sequence, antibodies that bind specifically to a DRR (i.e. 
that have at least a 100 fold greater affinity for the DRR than any other naturally occurring 
non-DRR protein), and antibodies made by a process involving the injection of 
pharmaceutically acceptable preparations of such proteins into an animal capable of 
antibody production. In a preferred embodiment, monoclonal antibody to human or rat 
DRR is produced by injecting a pharmaceutically acceptable preparation of the receptor 
into a mouse and then fusing mouse spleen cells with myeloma cells. 

The invention is also directed to a substantially pure polynucleotide encoding a protein 
comprising the amino acid sequence consisting functionally of the sequence of rat DRR (as 
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shown in SEQ ID NO:l) or a human DRR (as shown in SEQ ID NOs 3, 5, 7, 9, 1 1 or 13). 
This aspect of the invention encompasses polynucleotides encoding proteins consisting 
essentially of the amino acid sequences shown in the sequence listing, expression vectors 
comprising such polynucleotides, and host cells transformed with such vectors. Also 
5 included are the recombinant rat and human DRR proteins produced by host cells made in 
this manner. 

Preferably, the polynucleotide encoding rat DRR has the nucleotide sequence shown in 
SEQ ID NO:2 and the polynucleotide encoding a human DRR has the nucleotide sequence 
10 shown in SEQ ID NO: 3, 5, 7, 9, 1 1 or 13. It is also preferred that the vectors and host 
cells used for the expression-ofDRR contain these particular polynucleotides. 

In another aspect, the present invention is directed to a method for assaying a test 
compound for its ability to bind to a rat or human DRR. The method is performed by 

is incubating a source of DRR with a ligand known to bind to the receptor and with the test 
compound. The source of the DRR should be substantially free of other types of G protein- 
coupled receptors, i.e. greater than 85% of such receptors present should correspond to the 
DRR. Upon completion of incubation, the ability of the test compound to bind to the DRR 
is determined by the extent to which ligand binding has been displaced. The rat DRR 

20 should, preferably correspond to rDRR-1 as shown in SEQ ID NO:l. The human receptor 
should preferably be hDRR-1 (SEQ ID NO:3); bDRR-2 (SEQ ID NO:5); hDRR-3 (SEQ ID 
NO:7); hDRR-4 (SEQ ID NO:9); hDRR-5 (SEQ ID NO:l 1); or hDRR-6 (SEQ ID NO: 13). 
Either transformed cells expressing recombinant DRR may be used in the assays or 
membranes can be prepared from the cells and used. Although not essential, the assay can 

25 be accompanied by the determination of the activation of a second messenger pathway such 
as the adenyl cyclase pathway. This should help to determine whether a compound that 
binds to DRR is acting as an agonist or antagonist. 

An alternative method for determining if a test compound is an agonist of any of the 
30 DRRs disclosed herein is to use a cell signaling assay, e.g., an assay measuring either 
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intracellular adenyl cyclase activity or intracellular calcium concentration. The test 
compound is incubated with cells expressing the DRR but substantially free of other G 
protein-coupled receptors, typically a cell transfected with an expression vector encoding 
the DRR. Test compounds that are agonists are identified by their causing a statistically 
5 significant change in the results obtained from the cell signaling assay when compared to 
control transfectants not exposed to test compound. For example, the cells exposed to the 
test compound may show a significant increase in adenyl cyclase activity or in intracellular 
calcium concentration. 

10 The invention also encompasses a method for determining if a test compound is an 
antagonist of a DRR- which relies upon the- known- acti vationof G protein^coupled 
receptors that occurs when such receptors are expressed in large amounts. This method 
requires that DNA encoding the receptor be incorporated into an expression vector so that 
it is operably linked to a promoter and that the vector then be used to transfect an 
is appropriate host. In order to produce sufficient receptor to result in constitutive receptor 
activation (i.e., activation in the absence of natural ligand), expression systems capable of 
copious protein production are preferred, e.g., the DRR DNA may be operably linked to a 
CMV promoter and expressed in COS or HEK293 cells. After transfection, cells with 
activated receptors are selected based upon their showing increased activity in a cell 
20 signaling assay relative to comparable cells that have either not been transfected or that 
have been transfected with a vector that is incapable of expressing functional DRR. 
Typically, cells will be selected either because they show a statistically significant 
increase in intracellular adenyl cyclase activity or a statistically significant increase in 
intracellular calcium concentration. The selected cells are contacted with the test 
25 compound and the cell signaling assay is repeated to determine if this results in a decrease 
in activity relative to control cells not contacted with the test compound. For example, a 
statistically significant decrease in either adenyl cyclase activity or calcium concentration 
relative to control cells would indicate that the test compound is an antagonist of the DRR. 
Any of the DRRs disclosed herein may be used in these assays. 



30 
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Assays for compounds interacting with a DRR may be performed by incubating a source 
containing the DRR but substantially free of other G protein-coupled receptors (e.g. a 
stably transformed cell) with angiotensin II or HI in both the presence and absence of test 
compound and measuring the modulation of intracellular calcium concentration. A 
significant increase or decrease in angiotensin-stimulated calcium displacement in response 
to test compound is indicative of an interaction occurring at the DRR. The receptors that 
may be used in these assays include rat DRR-1 and human DRR-1, DRR-2, DRR-3, DRR- 
4, DRR- 5 and DRR-6. 

In another aspect, the present invention is directed to a method for assaying a test 
compound-forks- ability to alter the expression of a fat brhuman DRR. This method is 
performed by growing cells expressing the DRR, but substantially free of other G protein- 
coupled receptors, in the presence of the test compound. Cells are then collected and the 
expression of the DRR is compared with expression in control cells grown under 
essentially identical conditions but in the absence of the test compound. The rat receptor is 
preferably rDRR-1 and the human receptor may be DRR-1; DRR-2; DRR-3; DRR-4; DRR- 
5; or DRR-6. 

A preferred test compound is an oligonucleotide at least 15 nucleotides in length 
20 comprising a sequence complimentary to the sequence of the DRR used in the assay. 
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Brief Description of the Drawings 

Figure 1. Nucleotide sequence of rDRR- 1 : Clone 3B-32, encoding rDRR-1, was isolated 
from a rat genomic library using the Promoter Finder Walking Kit (see Methods, 
Clontech). 

5 

The cloned gene was deposited with the international depositary authority Deutsche 
Sammlung Von Mikroorganismen Und Zellkulturen GmbH at the address Mascheroder 
Weg 1 B, D-3300 Braunschweig, Germany. The deposit was made on November 27, 1998 
and was given the accession number DSM 1 1877. 

10 

Figure 2. Deduced amino acid sequence 6TDRR- 1 : Clone 3B -32" codes" for a 337 amino 
acid protein. The amino acid sequence begins with the first ATG in the nucleotide 
sequence. 

is Figure 3. Alignment of the deduced amino acid sequences of clone 3B-32 (rDRR-1) with 
its five most homologous sequences. The boxed and shaded residues are the ones that are 
identical to the rDRR-1 sequence. 

Figure 4. Amino acid alignment of the human DRR homologs: The amino acid sequence 
20 of all 6 human homologs of rDRR-1 (hDRR-1; hDRR-2; hDRR-3; hDRR-4; hDRR-5; and 
hDRR-6 ) are aligned. The amino acid residues differing from the clone 36 
(HUMAN36.PR) are boxed. The degree of identity among these sequences ranges from 
77% to almost 100%. 

25 Definitions 

The description that follows uses a number of terms that refer to recombinant DNA 
technology. In order to provide a clear and consistent understanding of the specification 
and claims, including the scope to be given such terms, the following definitions are 
provided. 

30 
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Cloning vector: A plasmid or phage DNA or other DNA sequence which is able to 
replicate autonomously in a host cell, and which is characterized by one or a small number 
of restriction endonuclease recognition sites. A foreign DNA fragment may be spliced into 
the vector at these sites in order to bring about the replication and cloning of the fragment. 
5 The vector may contain a marker suitable for use in the identification of transformed cells. 
For example, markers may provide tetracycline resistance or ampicillin resistance. 

Expression vector: A vector similar to a cloning vector but which is capable of inducing 
the expression of the DNA that has been cloned into it, after transformation into a host. 
10 The cloned DNA is usually placed under the control of (i.e., operably linked to) certain 
regulatory sequences such as promoters of "enhancers. Promoter sequences may be 
constitutive, inducible or repressible. 

Substantially pure: As used herein, "substantially pure" means that the desired product 
is is essentially free from contaminating cellular components. A "substantially pure" protein 
or nucleic acid will typically comprise at least 85% of a sample, with greater percentages 
being preferred. Contaminants may include proteins, carbohydrates or lipids. One method 
for determining the purity of a protein or nucleic acid is by electrophoresing a preparation 
in a matrix such as polyacrylamide or agarose. Purity is evidenced by the appearance of a 
20 single band after staining. Other methods for assessing purity include chromatography and 
analytical centrifugation. 

Host: Any prokaryotic or eukaryotic cell that is the recipient of a replicable expression 
vector or cloning vector is the "host" for that vector. The term encompasses prokaryotic or 
25 eukaryotic cells that have been engineered to incorporate a desired gene on its chromosome 
or in its genome. Examples of cells that can serve as hosts are well known in the art, as are 
techniques for cellular transformation (see e.g. Sambrook et al., Molecular Cloning: A 
Laboratory Manual, 2nd ed. Cold Spring Harbor (1989)). 



30 



Promoter: A DNA sequence typically found in the 5 region of a gene, located proximal 
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to the start codon. Transcription is initiated at the promoter. If the promoter is of the 
inducible type, then the rate of transcription increases in response to an inducing agent. 

Complementary Nucleotide Sequence: A complementary nucleotide sequence, as used 
5 herein, refers to the sequence that would arise by normal base pairing. For example, the 
nucleotide sequence 5 -AGAC-3 would have the complementary sequence 5 - GTCT-3 . 

Expression: Expression is the process by which a polypeptide is produced from DNA. 
The process involves the transcription of the gene into mRNA and the translation of this 
10 mRNA into a polypeptide. 

Detailed Description of the Invention 

The present invention is directed to DRR receptor proteins, genetic sequences coding for 
15 the receptors, a method for assaying compounds for binding to DRR receptors and a 
method for assaying compounds for their ability to alter DRR expression. The receptors 
and their nucleic acids are defined by their structures (as shown in figures 1,2 and 4; and 
SEQ ID numbers 1-14). 

20 It will be understood that the present invention encompasses not only sequences identical 
to those shown in the figures and sequence listing, but also sequences that are essentially 
the same and sequences that are otherwise substantially the same and which result in a 
receptor retaining the basic binding characteristics of the DRR. For example, it is well 
known that techniques such as site-directed mutagenesis may be used to introduce 

25 variations in a protein's structure. Variations in a DRR protein introduced by this or some 
similar method are encompassed by the invention provided that the resulting receptor 
retains the basic qualitative binding characteristics of the unaltered DRR. Thus, the 
invention relates to proteins comprising amino acid sequences consisting functionally of 
the sequence of SEQ ID NO: 1 (rat) and SEQ ID numbers 3, 5, 7, 9, 1 1 and 14 (human). 



30 
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I. Nucleic Acid Sequences Coding for DRR 

DNA sequences coding for DRRs are expressed exclusively, or at least highly 
preferentially, in dorsal root ganglia and these ganglia may serve as a source for the 
isolation of nucleic acids coding for the receptors. In addition, cells and cell lines that 
express a rat or human DRR may serve as a source for nucleic acid. These may either be 
cultured cells that have not undergone transformation or cell lines specifically engineered 
to express recombinant DRR. 

In all cases, poly A+ mRNA is isolated from the dorsal root ganglia, reverse transcribed 
and cloned. The cDNA library thus formed may then be screened using probes derived 
fronrthe sequences shown in the - " iccomplunyhYg sequence hsting as~SEQlD number 2, 4, 6, 
8, 10, 12 or 14, depending upon the particular DRR being isolated. Probes should typically 
be at least 14 bases in length and should be derived from a portion of the DRR sequence 
that is poorly conserved (see Figures 3 and 4). Screening can also be performed using 
genomic libraries with one DRR gene, or a portion of the gene, serving as a probe in the 
isolation of other DRR genes. For example, full length rDRR-1 may be labeled and used to 
screen a human genomic library for the isolation of hDRR-1, hDRR-2 etc. (see Examples 
section). 

Alternatively genomic DNA libraries can be used to isolate DRR genes by performing 
PCR amplifications with primers located at either end of genes (see Examples section for a 
description of procedures). For example, human genomic DNA may be amplified using 
the primers: 



5 ' -GC AAGCTTTCTG AGC ATGG ATCC A ACCGTC, and 
5'-CCCTCAGATCTCCAATTTGCTTCCCGACAG. 
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This will serve to amplify all six of the human DRR genes identified herein as hDRR- 1 ; 
hDRR-2; hDRR-3; hDRR -4; hDRR-5; and hDRR-6. These may then be cloned into an 
appropriate vector, e.g. pGEM-T (Promega), for DNA sequence analysis. 

5 II. Antibodies to Rat and Human DRRs 

The present invention is also directed to antibodies that bind specifically to a rat or human 
DRR and to a process for producing such antibodies. Antibodies that "bind specifically to a 
DRR" are defined as those that have at least a one hundred fold greater affinity for the 
DRR than for any other protein. The process for producing such antibodies may involve 
io either injecting the DRR protein itself into an appropriate animal or, preferably, injecting 
stornwptfaes ffiaaeTo correspond to different regions of the DRR. The peptides should be 
at least five amino acids in length and should be selected from regions believed to be 
unique to the particular DRR protein being targeted. Thus, highly conserved 
transmembrane regions should generally be avoided in selecting peptides for the generation 
is of antibodies. Methods for making and detecting antibodies are well known to those of 
skill in the art as evidenced by standard reference works such as: (Harlow et al. 
Antibodies, A Laboratory Manual, Cold Spring Harbor Laboratory, NY. (1 988)); Klein, 
Immunology: The Science of Self -Nonself Discrimination (1982); Kennett, et al., 
Monoclonal Antibodies and Hybridomas: A New Dimension in Biological Analyses (1980); 
20 and Campbell, "Monoclonal Antibody Technology, " in Laboratory Techniques in 
Biochemistry and Molecular Biology, (1984)). 

"Antibody," as used herein, is meant to include intact molecules as well as fragments which 
retain their ability to bind to antigen (e.g., Fab and F(ab )2 fragments). These fragments 

25 typically produced by proteolytically cleaving intact antibodies using enzymes such 
papain (to produce Fab fragments) or pepsin (to produce F(ab )2 fragments). The term 
"antibody" also refers to both monoclonal antibodies and polyclonal antibodies. Polyclonal 
antibodies are derived from the sera of animals immunized with the antigen. Monoclonal 
antibodies can be prepared using hybridoma technology (Kohler, et al., Nature 256:495 

o (1975); Hammerling, et al, in: Monoclonal Antibodies and T-Cell Hybridomas, Elsevier, 



are 
as 
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M.Y., pp. 563-681 (1981)). In general, this technology involves immunizing an animal, 
usually a mouse, with either intact DRR or a fragment derived from the DRR. The 
splenocytes of the immunized animals are extracted and fused with suitable myeloma cells, 
e.g., SP20 cells. After fusion, the resulting hybridoma cells are selectively maintained in 
5 HAT medium and then cloned by limiting dilution (Wands, et aL, Gastroenterology 

80:225-232 (1981)). The cells obtained through such selection are then assayed to identify 
clones which secrete antibodies capable of binding to the DRR. 

The antibodies, or fragments of antibodies, of the present invention may be used to detect 
10 the presence of DRR protein using any of a variety of immunoassays. For example, the 
^ibodiesIhayTie usedln radioimnumoassays of In Ti^unon^tfic assays" also known as^ 
"two-site" or "sandwich" assays (see Chard, T, "An Introduction to Radioimmune Assay 
and Related Techniques, " in Laboratory Techniques in Biochemistry and Molecular 
Biology, North Holland Publishing Co., N.Y. (1978)). In a typical immunometric assay, a 

15 quantity of unlabeled antibody is bound to a solid support that is insoluble in the fluid 

being tested, e.g., blood, lymph, cellular extracts, etc. After the initial binding of antigen to 
immobilized antibody, a quantity of detectably labeled second antibody (which may or may 
not be the same as the first) is added to permit detection and/or quantitation of bound 
antigen (see e.g. Radioimmune Assay Method, Kirkham et al., ed., pp. 199-206, E&S. 

20 Livingstone, Edinburgh (1970)). Many variations of these types of assays are known in the 
art and may be employed for the detection of the DRR. 

Antibodies to a rat or human DRR may also be used in the purification of either the intact 
receptor or fragments of the receptor (see generally, Dean et al, Affinity Chromatography, 
25 A Practical Approach, IRL Press (1986)). Typically, antibody is immobilized on a 

chromatographic matrix such as Sepharose 4B. The matrix is then packed into a column 
and the preparation containing the DRR desired is passed through under conditions that 
promote binding, e.g., under conditions of low salt. The column is then washed and bound 
DRR is eluted using a buffer that promotes dissociation from antibody, e.g., buffer having 
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an altered pH or salt concentration. The eluted DRR may be transferred into a buffer of 
choice, e.g., by dialysis, and either stored or used directly. 

III. Radioligand Assay for Receptor Binding 

5 One of the main uses for DRR nucleic acids and recombinant proteins is in assays 

designed to identify agents capable of binding to DRR receptors. Such agents may either be 
agonists, mimicking the normal effects of receptor binding, or antagonists, inhibiting the 
normal effects of receptor binding. Of particular interest is the identification of agents 
which bind to the DRR and modulate adenyl cyclase activity in the cells. These agents have 

io potential therapeutic application as either analgesics or anesthetics. 

Iin^ioligMa bindih^Msays, a source" oTDRR isTin^ a ligand known 

to bind to the receptor and with the compound being tested for binding activity. The 
preferred source for DRR is cells, preferably mammalian cells, transformed to 
recombinantly express the receptor. The cells selected should not express a substantial 

15 amount of any other G protein-coupled receptors that might bind to ligand and distort 

results. This can easily be determined by performing binding assays on cells derived from 
the same tissue or cell line as those recombinantly expressing DRR but which have not 
undergone transformation. 

20 The assay may be performed either with intact cells or with membranes prepared from 

the cells (see e.g. Wang, etaL, Proc. Natl. Acad. Sci. U.SA. 90:10230-10234 (1993)). The 
membranes are incubated with a ligand specific for the DRR receptor and with a 
preparation of the compound being tested. After binding is complete, receptor is separated 
from the solution containing ligand and test compound, e.g. by filtration, and the amount of 

25 binding that has occurred is determined. Preferably, the ligand used is detectably labeled 
with a radioisotope such as 1251. However, if desired, fluorescent or chemiluminescent 
labels can be used instead. Among the most commonly used fluorescent labeling 
compounds are fluorescein isothiocynate, rhodamine, phycoerythrin, phycocyanin, 
allophycocyanin, o-phthaldehyde and fluorescamine. Useful chemiluminescent compounds 

30 include luminol, isoluminol, theromatic acridinium ester, imidazole, acridinium salt, and 
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oxalate ester. Any of these agents which can be used to produce a ligand suitable for use in 
the assay. 

Nonspecific binding may be determined by carrying out the binding reaction in the 
5 presence of a large excess of unlabeled ligand. For example, labeled ligand may be 
incubated with receptor and test compound in the presence of a thousandfold excess of 
unlabeled ligand. Nonspecific binding should be subtracted from total binding, i.e. binding 
in the absence of unlabeled ligand, to arrive at the specific binding for each sample tested. 
Other steps such as washing, stirring, shaking, filtering and the like may be included in the 
10 assays as necessary. Typically, wash steps are included after the separation of membrane- 
b"ound"ligand"frmirli^and remaimngin solution and prior" to quahtiMiorTof"tfie"amount of " 
ligand bound, e.g., by counting radioactive isotope. The specific binding obtained in the 
presence of test compound is compared with that obtained in the presence of labeled ligand 
alone to determine the extent to which the test compound has displaced receptor binding. 

15 

In performing binding assays, care must be taken to avoid artifacts which may make it 
appear that a test compound is interacting with the DRR receptor when, in fact, binding is 
being inhibited by some other mechanism. For example, the compound being tested should 
be in a buffer which does not itself substantially inhibit the binding of ligand to DRR and 

20 should, preferably, be tested at several different concentrations. Preparations of test 
compound should also be examined for proteolytic activity and it is desirable that 
antiproteases be included in assays. Finally, it is highly desirable that compounds identified 
as displacing the binding of ligand to DRR receptor be reexamined in a concentration range 
sufficient to perform a Scatchard analysis on the results. This type of analysis is well 

25 known in the art and can be used for determining the affinity of a test compounds for 

receptor (see e.g., Ausubel, etaL, Current Protocols in Molecular Biology, 11.2.1-11.2.19 
(1993); Laboratory Techniques and Biochemistry and Molecular Biology, Work, et al, ed., 
N.Y. (1978) etc.). Computer programs may be used to help in the analysis of results (see 
e.g., Munson, P., Methods Enzymol. 92:543-577 (1983); McPherson, G.A., Kinetic, EBDA 
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Ligand, Lowry-A Collection of Radioligand Binding Analysis Programs, Elsevier-Biosoft, 
U.K. (1985)). 

The activation of receptor by the binding of ligand may be monitored using a number of 
5 different assays. For example, adenyl cyclase assays may be performed by growing cells in 
wells of a microtiter plate and then incubating the various wells in the presence or absence 
of test compound. cAMP may then be extracted in ethanol, lyophilized and resuspended in 
assay buffer. Assay of cAMP thus recovered may be carried out using any method for 
determining cAMP concentration, e.g. the Biotrack cAMP Enzyme-immunoassay System 
10 (Amersham) or the Cyclic AMP [3H] Assay System (Amersham). Typically, adenyl 
cyclase assays will be performed separately from"bi"n"ding" assays; but" it may also be 
possible to perform binding and adenyl cyclase assays on a single preparation of cells. 
Other "cell signaling assays" that can be used to monitor receptor activity are described 
below. 

15 

IV. Identification of DRR Agonists and Antagonists Using Cell Signaling Assays 
DRRs may also be used to screen for drug candidates using cell signaling assays. To 
identify DRR agonists, the DNA encoding a receptor is incorporated into an expression 
vector and then transfected into an appropriate host. The transformed cells are then 
20 contacted with a series of test compounds and the effect of each is monitored. Among the 
assays that can be used are assays measuring cAMP production (see discussion above), 
assays measuring the activation of reporter gene activity, or assays measuring the 
modulation of the binding of GTP-gamma-S. 

25 Cell signaling assays may also be used to identify DRR antagonists. G protein-coupled 
receptors can be put in their active state even in the absence of their cognate ligand by 
expressing them at very high concentration in a heterologous system. For example, receptor 
may be overexpressed using the baculovirus infection of insect Sf9 cells or a DRR gene 
may be operably linked to a CMV promoter and expressed in COS or HEK293 cells. In this 

30 activated constitutive state, antagonists of the receptor can be identified in the absence of 
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ligand by measuring the ability of a test compound to inhibit constitutive cell signaling 
activity. Appropriate assays for this are, again, cAMP assays, reporter gene activation 
assays or assays measuring the binding of GTP-gamma-S. 

5 One preferred cell signaling assay is based upon the observation that cells stably 
transfected with DRRs show a change in intracellular calcium levels in response to 
incubation in the presence of angiotensin II or HI (see Example 5). Thus, a procedure can 
be used to identify DRR agonists or antagonists that is similar to the radioreceptor assays 
discussed above except that angiotensin H or HI is used instead of a labeled ligand and 
10 calcium concentration is measured instead of bound radioactivity. The concentration of 
calcium in the presence of test compound andwgioKnsin ITof HTis "compared with that in 
the presence of angiotensin II or in alone to determine whether the test compound is 
interacting at the DRR receptor. A statistically significant increase in intracellular calcium 
in response to test compound indicates that the test compound is acting as an agonist 
whereas a statistically significant decrease in intracellular calcium indicates that it is acting 
as an antagonist. 

V. Assay for Ability to Modulate DRR Expression 

One way to either increase or decrease the biological effects of a DRR is to alter the 
extent to which the receptor is expressed in cells. Therefore, assays for the identification of 
compounds that either inhibit or enhance expression are of considerable interest. These 
assays are carried out by growing cells expressing a DRR in the presence of a test 
compound and then comparing receptor expression in these cells with expression in cells 
grown under essentially identical conditions but in the absence of the test compound. As in 
the binding assays discussed above, it is desirable that the cells used be substantially free of 
competing G protein-coupled receptors. One way to quantitate receptor expression is to 
fuse the DRR sequence to a sequence encoding a peptide or protein that can be readily 
quantitated. For example, the DRR sequence may be ligated to a sequence encoding 
haemaglutinin as described in Example 5 and used to stably transfect cells. After 
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incubation with test compound the hemagglutininn/receptor complex can be 
immunoprecipitated and western blotted with anti- haemaglutinin antibody. 
Alternatively, Scatchard analysis of binding assays may be performed with labeled ligand 
to determine receptor number. The binding assays may be carried out as discussed above 
and will preferably utilize cells that have been engineered to recombinantly express DRR. 

A preferred group of test compounds for inclusion in the DRR expression assay consists 
of oligonucleotides complementary to various segments of the DRR nucleic acid sequence. 
These oligonucleotides should be at least 15 bases in length and should be derived from 
non-conserved regions of the receptor nucleic acid sequence. Sequences may be based 
upon those shown-as-SEQ ID numbers 2, 4, 6, 8; TO, 12 or 14. 

Oligonucleotides which are found to reduce receptor expression may be derivatized or 
conjugated in order to increase their effectiveness. For example, nucleoside 
phosphorothioates may be substituted for their natural counterparts (see Cohen, J., 
Oligodeoxynucleotides, Antisense Inhibitors of Gene Expression, CRC Press (1989)). The 
oligonucleotides may be delivered to a patient in vivo for the purpose of inhibiting DRR 
expression. When this is done, it is preferred that the oligonucleotide be administered in a 
form that enhances its uptake by cells. For example, the oligonucleotide may be delivered 
by means of a liposome or conjugated to a peptide that is ingested by cells (see e.g., U.S. 
Patent Nos. 4,897,355 and 4,394,448; see also non-U.S. patent documents WO 8903849 
and EP 0263740). Other methods for enhancing the efficiency of oligonucleotide delivery 
are well known in the art and are also compatible with the present invention. 

Having now described the invention, the same will be more readily understood through 
reference to the following Examples which are provided by way of illustration and which 
are not intended to limit the scope of the invention. 
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EXAMPLES 



Example 1 : Cloning of Rat DRR- 1 
Isolation of cDNA fragment. 



Degenerate oligonucleotides were synthesized to highly conserved regions of G-protein 
coupled receptors (transmembrane spanning domains 2 and 7) with the following 
nucleotide sequences: 



5' GG CCG TCG ACT TCA TCG TC(A/T) (A/C)(T/C)C TI(G/T) CI(T/C) TIG 
C(-A/C/G/T)G 3* (TO2ysense)-SEQ-ID NOT15; and 

5' (A/G)(C/A/r)(A/r) (A/G)CA (A/G)TA IAT IAT IGG (A/G)TT 3' 
(TM7:antisense) SEQ ID NO: 16. 

Poly A+ mRNA was isolated from cultured fetal rat dorsal root ganglia 
(Sprague-Dawley). The mRNA was reverse transcribed using the First Strand cDNA 
Synthesis kit (Pharmacia Biotech), subjected to an amplification reaction by polymerase 
chain reaction (PCR) using Ampli-Taq DNA (Perkin-Elmer Cetus) polymerase under the 
following conditions: 3 minutes at 94 °C, 40 cycles of 1 minute at 94 °C, 45°C and 72 °C. 
A cDNA PCR fragment corresponding to approximately 650 bps was isolated and 
subcloned in pGEM-T-vector (Promega Corporation). The nucleotide sequence of the 
recombinant clone was determined using the T7-dideoxy chain termination sequencing kit 
(Pharmacia Biotech) and was found to be unique based upon searches of Genbank/EMBL 
databases. 

The full length rat DRR-1 sequence was obtained from rat genomic DNA using the 650 
base pair fragment and the "Promoter Finder DNA Walking kit" (Clontech, cat # K1806-1). 
This kit contains five libraries of uncloned, adaptor-ligated genomic DNA fragments. The 
procedure involves two consecutive PCR reactions. Both reactions were done using the 



N 1807-1 SE 



18 



"Advantage Tth Polymerase Mix" also obtained from Clontech, following the conditions 
recommended by the vendor. The first PCR reaction was performed with the outer adaptor 
primer (API) provided in the kit and an outer, gene-specific primer (GSP1) derived from 
the sequence of the DRR-1 PCR fragment. The primary PCR mixtures were diluted and 
5 used as a template for the secondary (nested) PCR reaction with the nested adapter primer 
(AP2) and a nested gene specific primer (GSP2). To obtain the sequence of the rat DRR-1 
gene upstream of the sequence of the original PCR fragment, the following 
oligonucleotides were used: 

io GSP1: oligo YF3B59-B, 5 ' -CGC AGATGAGGT AGTAC AGC ATC AC SEQ ID NO:17 
GSP2- oligo MML-R 1,5'- GTGTGAGAGAGATGGTAACATACAG" SEQ ID NOTlS 

From the first library, a fragment AP2-MMLR1 of 1.9 Kb was obtained and from the 
third library, a fragment of approximately 1.0 Kb was obtained. To identify the sequence 
is downstream of the known sequence, the following primers were used: 

GSP1: oligo YF3B59-F2, 5 '-GCATCCTTGACTGGTTCTTCTCAG SEQ ID NO:19 
GSP2: oligo MML-F1, 5'- GGGTG AG ACTC ATC ATC ATTTGTGG. SEQ ID NO:20 



20 



25 



A fragment MMLF1-AP2 of approximately 1 Kb was obtained from the first library and 
a fragment of about 600 bp was obtained from the third library. The composite sequence of 
1 154 nucleotides containing the complete predicted open reading frame of DRR-1 is shown 
in Figure 1. The open reading frame codes for a 337 amino acid protein (Figure 2) with a 
predicted molecular mass of 38.7 kD. The protein sequence contains all the characteristic 
features of G protein-coupled receptors: seven hydrophobic helices likely to represent 
transmembrane domains, potential glycosylate site at the N-terminal extracellular domain 
(position 30) and a conserved NPXXY sequence at position 285-289. 
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Example 2: Clo ning of Human DRR Receptor Genes 

Two approaches were used to identify and clone novel human DNA sequences 
homologous and/or related to the rat DRR-1 gene. First, a human genomic library was 
5 screened in the lambda vector, Fix II, (Stratagene Cat.# 946203). Approximately 106 
human genomic clones were plated and transferred onto nitrocellulose membranes for 
hybridization with the full length, 32P labeled, rat DRR-1 sequence as a probe. The 
hybridization was performed at 42 °C, overnight. The filters were washed at room 
temperature at low stringency (IX SSC/ 0. 1% SDS) to allow detection of related but not 
10 necessarily identical sequences. 



The inserted human DNA present in positive phages was amplified by PCR using the 
"Expand PCR kit" from Boehringer-Mannheim under conditions allowing accurate 
amplification of very large fragments of DNA. These long fragments of DNA were 
digested with various restriction enzymes and subcloned into a plasmid vector. The 
portions of these clones which hybridized with the rat DRR-1 gene probe were sequenced 
using the ABI cycle sequencing kit. 



15 



A second approach to identifying novel human sequences related to DRR-1 involved the 
20 use of the polymerase chain reaction (PCR), performed on total human genomic DNA. 
Primers were synthesized based upon the human genomic clones described above and were 
as follows: 

HML.H, 5 ' -GC AAGCTTTCTG AGC ATGG ATCC AACCGTC, SEQ ID 21 and 
25 HML.Bg, 5 ' -CCCTC AG ATCTCC AATTTGCTTCCCGAC AG, SEQ ID NO:22. 

Amplification resulted in a fragments of approximately 1 kilobase containing the entire 
coding sequence of the human genes. These fragments obtained were subcloned into the 
pGEM-T (Promega) vector for DNA sequencing analysis. 



30 
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Using the above strategies, six human clones were isolated: 
clone 7, SEQ ID numbers 3 and 4; 
clone 18, SEQ ID numbers 5 and 6; 
clone 23, SEQ ID numbers 7 and 8; 
5 clone 24, SEQ ID numbers 9 and 10; 

clone 36, SEQ ID numbers 1 1 and 12; and 
clone 40, SEQ ID numbers 13 and 14. 

None of these clones contain introns and their alignment may be seen in Figure 3. 

10 

At the amino~ acid sequence level, the rat DRR- 1 clone is 47%to 49%~identical to the 
human clones. 

At the nucleic acid level, the rat DRR-1 clone is 56% to 58% identical to the human 
is clones .The level of sequence identity within the human clones (7, 18, 23, 24, 36, 40) is 
very high, between 77% and 98% at the amino acid sequence level. All the human 
sequences were used as queries to search for homologies in public databases (Genbank, 
Swissprot, EST). No identical sequences were detected. The closest matches were to 
members of the mas oncogene family of proteins. The overall amino acid sequence 
20 homology between rat DRR-1 and any of the isolated human genes varied from 47 to 50 %. 
However some stretches display a much higher level of sequence homology, particularly 
the regions encoding the putative transmembrane domain III and VII (TM3 and TM7) and 
the intracellular loops 2 and 3 where the homology between the rat sequence and its human 
homologue is around 80%. 

25 

Example 3: In Situ Hybridization Experiments 

Preparation of Tissue: Adult male Sprague-Dawley rats (-300 gm; Charles River, 
St-Constant, Quebec) were sacrificed by decapitation. Brain and spinal cord with dorsal 
root ganglia attached were removed, snap-frozen in isopentane at -40°C for 20 s and stored 
30 at -80 °C. Frozen human brain, spinal cord and dorsal root ganglia were obtained from the 
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Brain and Tissue Bank for Developmental Disorders, University of Maryland at Baltimore, 
according to the strictest ethical guidelines. Frozen tissue was sectioned at 14 m in a 
Microm HM 500 M cryostat (Germany) and thaw-mounted onto ProbeOn Plus slides 
(Fisher Scientific, Montreal, Quebec). Sections were stored at -80°C prior to in situ 
hybridization. 

Synthesis of Riboprobes: The plasmid pGemT-3b32 GPCR was linearized using either 
SacII and Not 1 restriction enzymes. Sense and antisense DRR riboprobes were transcribed 
in vitro using either T7 or SP6 RNA polymerases (Pharmacia Biotech), respectively in the 
presence of [35S]UTP (-800 Ci/mmol; Amersham, Oakville, Ontario). The plasmid 
pGemT^eione-36 GPCR: was^lmearized using SacII andTsTl "restncSdn enzymes. Sense 
and antisense Clon36 riboprobes were transcribed in vitro using either SP6 or T7 RNA 
polymerases (Pharmacia Biotech), respectively in the presence of [35S]UTP. Following 
transcription, the DNA template was digested with DNAse I (Pharmacia). Riboprobes were 
purified by phenol/chloroform/isoamyl alcohol extraction and precipitated in 70% ethanol 
containing ammonium acetate and tRNA. Quality of labeled riboprobes was verified by 
polyacrylamide-urea gel electrophoresis. 

In situ Hybridization: Sections were postfixed in 4% paraformaldehyde (BDH, Poole, 
England) in 0.1 M phosphate buffer (pH 7.4) for 10 min at room temperature (RT) and 
rinsed in three changes of 2X standard sodium citrate buffer (SSC; 0.15 M NaCl. 0.015 M 
sodium citrate, pH 7.0). Sections were then equilibrated in 0. 1 M triethanolamine, treated 
with 0.25% acetic anhydride in triethanolamine, rinsed in 2X SSC and dehydrated in an 
ethanol series (50-100%). Hybridization was performed in a buffer containing 75% 
formamide (Sigma, St-Louis, Mo), 600 mM NaCl, 10 mM Tris (pH 7.5), 1 mM EDTA, IX 
Denhardt's solution (Sigma), 50 (g/ml denatured salmon sperm DNA (Sigma), 50 (g/ml 
yeast tRNA (Sigma), 10% dextran sulfate (Sigma), 20 mM dithiothreitol and [35S]UTP- 
labeled cRNA probes (10 X 106 cpm/ml) at 55°C for 18 h in humidified chambers. 
Following hybridization, slides were rinsed in 2X SSC at RT, treated with 20 (g/ml RNase 
IA (Pharmacia) in RNase buffer (10 mM Tris, 500 mM NaCl, 1 mM EDTA, pH 7.5) for 45 
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min at RT and washed to a final stringency of 0.1X SSC at 65 °C. Sections were then 
dehydrated and exposed to Kodak Biomax MR film for 21 days and/or dipped in Kodak 
NTB2 emulsion diluted 1:1 with distilled water and exposed for 4-6 weeks at 4°C prior to 
development and counterstaining with cresyl violet acetate (Sigma). 

5 

Results: Of all regions examined within the neuraxis of the rat, DRR-1 mRNA was 
exclusively expressed in dorsal root ganglia. High resolution emulsion autoradiography 
showed accumulations of silver grains exclusively over small and some medium size 
neurons. This unique and highly restricted distribution pattern for DRR-1 was confirmed in 
10 the rat embryo. Sagittal section of an E17 rat fetus showed that DRR-1 mRNA is confined 
to DRGs. AirotKef structures of The "rat embryo were devoid ofany specifichybndization 
signal reinforcing the highly selective nature of DRR-1 expression 

The expression of human Clone 36 receptor was present in human fetal dorsal root 
15 ganglia but not in spinal cord. Specific hybridization signal for Clone 36 was not detected 
in any of the human adult CNS tissues examined thus far. These include spinal cord, 
cortex, hippocampus, thalamus, substantia nigra and periaqueductal gray (data not shown). 
Presence of Clone 36 mRNA in adult DRGs remains to be examined. Standard controls in 
which additional spinal cord with DRG sections were hybridized with rat DRR-1 antisense 
20 or Clone 36 sense 35S-labeled probes displayed no specific hybridization signal. 

Example 4: Northern Blots 

Commercial rat and human multiple Northern blots containing 2 g of polyA RNA from 
various tissues (Clontech) were used to determine the expression and distribution of the rat 

25 DRR-1 message and its human homologues. Radioactively labeled probes were prepared as 
follows: twenty five ng of a 650 bp 3b-32 PCR fragment derived from rat DRR-1 (see 
Example 1) or human clone 36 were random-prime labeled using the Ready-to-Go DNA 
labeling kit (Pharmacia Biotech) and [32P]CTP (3000 Ci/mmol/Amersham). The blot was 
prehybridized for 1 hour at 68 °C using Expresshyb (Clontech) followed by hybridization 

30 (2X106 cpm/ml of probe) for one hour using the same conditions. Blots were washed at 



N 1807-1 SE 



23 



room temperature in 2X SSC, 0.05% SDS for 30 min. followed by 3x washes in 0.2X 
SSC, 1 % SDS at 50 °C for 60 min. and exposed at -80 °C to Kodak Biomax film for 6 
days. 

5 Expression and Distribution of rat DRR-1: All the rat tissues studied (heart, brain, 

spleen, lung, skeletal muscle, kidney and testis) were negative for the expression of DRR-1 
following 2 weeks exposure whereas rat genomic Southern analysis revealed a 1.1 kb band 
when probed with the same cDNA fragment. 

o Expression and Distribution of Human Clone 36: Northern blots containing RNA from 
- various-human-tissues weT^prob-ed-wif IT a radio-labeled DNA^ragmenFfromcIone 36. All 
the human tissues studied (human fetal brain, lung, liver and kidney and adult human 
cerebellum, cerebral cortex, medulla, occipital pole, frontal lobe, temporal lobe, putamen, 
spinal cord, amygdala, caudate nucleus, corpus caUosum, hippocampus, total brain, 

; subthalamic nucleus and thalamus) were negative for the expression of this receptor 
following 2 weeks exposure. 



Example 5: Calciu m Signaling in Response to Angiotensin I-HI 
The coding sequence of human clone 24 was transferred into a pcDNA3 vector and 
modified to add a haemaglutinin tag at the C-terminus of the receptor sequence. This 
clone, designated as pcDNA3-HML-HA24 was transfected into HEK293 cells using a 
modified CaCl 2 method (Maniatis, Molecular Cloning: A Laboratory Manual, Cold 
Spring Harbor Laboratory Press (1989)). The cells were maintained in culture medium at 
37 °C, 5% C0 2 and diluted 10 fold every 3 days. 

The cells were inoculated in 90 mm tissue culture dishes (5 x 105 cells per flask) in 
Dulbecco's Modified Essential Medium (DMEM, Gibco BRL), supplemented with 10% 
fetal bovine serum (FBS), 100 U/ml penicillin, 100 pg/ml streptomycin and 0.25 u.g/ml 
fungizone. One day after inoculation, cells were transiently transfected with 30 pig of 
plasmid DNA per dish. The cells were harvested 48 hours post transfection for analysis. 
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The expression of the gene was first checked by immunoprecipitation and western blotting 
with an anti-haemaglutinin antibody. A protein of approximately 43 KD was detected in 
stably as well as transiently transfected HEK293 cells, but not in control cells. 

5 Stably transfected HEK293 cells were obtained after approximately 21 days of selection 
in culture medium containing 800 |ig/ml G418. Calcium signaling measurement was 
performed with one of these stably transfected cell line, 293/pcDNA3-HML-HA24. The 
cells were grown on a 24 mm round glass cover slides to 50-70% confluence. After rinsing 
the cells with 1.8 NBS buffer (135 mM NaCl, 5 mM KC1, 1.2mMMgCl 2 , 1.8 mM 

10 CaCl2, 5 mM glucose and 10 mM HEPES, pH 7.3), the cells were incubated for one hour 
at room-temperaturrtn~the presence of 0:5" ml of 3.5" fXM FURA~-2 AM (Molecular Probe, 
F-1221) diluted in 1.8 NBS. The cells were then rinsed three times with 1.8 NBS and 
incubated for a further 30 minutes at room temperature. The calcium displacement was 
measured using a PTI (Photon Technology International) D104 photometer equipped with: a 

15 PTI Delta RAM High speed multiwavelength illuminator, a PTI SC500 Shutter controller, 
a PTI LPS220 ARC lamp supply and the PTI FELIX software, v. 1.2. Groups of 2 to 8 cells 
were chosen and isolated with the photometer diaphragm. The cells were exposed to 340 
and 380 nm light and the 510 nm light emitted by the cells was recorded. Angiotensin I, II 
and EI, were added successively - in various order from one experiment to the next - 

20 followed by bradykinin as a positive control. Upon stimulation with angiotensin II and 
angiotensin m, a significant response was obtained (figure 4). Addition of angiotensin I 
produced no response. 

All references cited herein are fully incorporated by reference. Having now fully 
25 described the invention, it will be understood by one of skill in the art that the invention 
may be performed within a wide and equivalent range of conditions, parameters and the 
like, without affecting the spirit or scope of the invention or any embodiment thereof. 
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SEQUENCE LISTING 
SEQUENCE LISTING 

(1) GENERAL INFORMATION: 

(i) APPLICANT: Astra Pharma Inc. Canada 
(ii) TITLE OF INVENTION: Novel receptor 
(Hi) NUMBER OF SEQUENCES": 22 

(iv) CORRESPONDENCE ADDRESS: 

(A) ADDRESSEE: Astra AB, Patent Department 

(B) STREET: S-151 85 Sodertalje 

(C) CITY: Sodertalje 

(D) STATE: 

(E) COUNTRY: Sweden 

(F) ZIP: none 

(V) COMPUTER READABLE FORM: 

(A) MEDIUM TYPE: Floppy disk 

(B) COMPUTER: IBM PC compatible 

(C) OPERATING SYSTEM: PC-DOS/MS-DOS 

(D) SOFTWARE: Patentln Release #1.0, Version #1. 

(vi) CURRENT APPLICATION DATA: 

(A) APPLICATION NUMBER: 

(B) FILING DATE: 

(C) CLASSIFICATION: 
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(ix) TELECOMMUNICATION INFORMATION: 

(A) TELEPHONE: 46-8 553 26000 

(B) TELEFAX: 46-8 553 28820 

) INFORMATION FOR SEQ ID NO:l: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 337 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : not relevant 

(D) TOPOLOGY: not relevant 

(ii) MOLECULE TYPE: protein 
(iii) HYPOTHETICAL: NO 
(iv) ANTI-SENSE: NO 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:l: 

Met Val Cys Val Leu Arg Asp Thr Thr Gly Arg Phe Val Ser Met Asp 
15 10 15 

Pro Thr lie Ser Ser Leu Ser Thr Glu Ser Thr Thr Leu Asn Lys Thr 
20 25 * 30 

Gly His Pro Ser Cys Arg Pro lie Leu Thr Leu Ser Phe Leu Val Pro 



35 



40 



45 
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He He Thr Leu Leu Gly Leu Ala Gly Asn Thr lie Val Leu Trp Leu 
50 55 60 

Leu Gly Phe Arg Met Arg Arg Lys Ala He Ser Val Tyr Val Leu Asn 
65 70 75 80 

Leu Ser Leu Ala Asp Ser Phe Phe Leu Cys Cys His Phe lie Asp Ser 
85 90 95 



Leu Met Arg He Met Asn Phe Tyr Gly He Tyr Ala His Lys Leu Ser 
100 105 110 

Lys Glu He Leu Gly Asn Val Ala Phe He Pro Tyr He Ser Gly Leu 
115 120 125 

Ser He Leu Ser Ala He Ser Thr Glu Arg Cys Leu Ser Val Leu Trp 
130 135 140 

Pro He Trp Tyr His Cys His Arg Pro Arg Asn Met Ser Ala He He 
145 150 155 160 

Cys Val Leu He Trp Val Leu Ser Phe Leu Met Gly He Leu Asp Trp 
165 170 175 

Phe Phe Ser Gly Phe Leu Gly Glu Thr His His His Leu Trp Lys Asn 
180 185 190 



Val Asp Phe He Val Thr Ala Phe Leu He Phe Leu Phe Met Leu Leu 
195 200 205 
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Phe Gly Ser Ser Leu Ala Leu Leu Val Arg lie Leu Cys Gly Ser Arg 
210 215 220 

Arg Lys Pro Leu Ser Arg Leu Tyr Val Thr lie Ser Leu Thr Val Met 
225 230 235 240 

Val Tyr Leu lie Cys Gly Leu Pro Leu Gly Leu Tyr Leu Phe Leu Leu 
245 250 255 

Tyr Tfp Phe Gly lie His Leu His" Tyr Pro Phe Cys~His~Ile Tyr Gin 
260 265 270 

Val Thr Val Leu Leu Ser Cys Val Asn Ser Ser Ala Asn Pro lie lie 
275 280 285 

Tyr Phe Leu Val Gly Ser Phe Arg His Arg Lys Lys His Arg Ser Leu 
290 295 300 

Lys Met Val Leu Lys Arg Ala Leu Glu Glu Thr Pro Glu Glu Asp Glu 
305 310 315 320 

Tyr Thr Asp Ser His Val Gin Lys Pro Thr Glu lie Ser Glu Arg Arg 
325 330 335 

Cys 



INFORMATION FOR SEQ ID NO : 2 : 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1011 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 
5 (D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(iii) HYPOTHETICAL: NO 

10 

(iv) ANTI-SENSE: NO 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 2 : 
15 ATGGTTTGTG TTCTCAGGGA CACTACTGGA AGATTTGTGA GCATGGATCC AACCATCTCA 60 
TCC CTCAGT A CAGAATCTAC AACACTGAAT AAAAC TGGTC ATCCCAGTTG CAGGCCAATC 120 
CTCACCCTGT CCTTCCTGGT CCCCATCATC ACCCTGCTTG GATTGGCAGG AAACACCATT 180 

20 

GTACTCTGGC TCTTGGGATT CCGCATGCGC AGGAAAGC C A TCTCAGTCTA CGTCCTCAAC 240 
CTGTCTCTGG CAGACTCCTT CTTCCTCTGC TGCCATTTTA TTGACTCTCT GATGCGGATC 300 
25 ATGAACTTCT ATGGCATCTA TGCC C AT AAA TTAAGCAAAG AAATCTTAGG CAATGTAGCA 3 60 

TTCATTCCCT ATATCTCAGG CCTGAGCATC CTCAGTGCTA TCAGCACGGA GCGCTGCCTG 42 0 

TCTGTATTGT GGCCAATCTG GTACCACTGC CACCGCCCAA GAAACATGTC AGCTATTATA 480 

30 
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TGTGTTCTAA TCTGGGTTCT GTCCTTTCTC ATGGGCATCC TTGACTGGTT TTTCTCAGGA 540 

TTCCTGGGTG AGACTCACCA TCATTTGTGG AAAAATGTTG ACTTTATTGT AACTGCATTT 600 

5 CTGATTTTTT TATTTATGCT TCTCTTTGGG TCCAGTCTGG CGCTACTGGT GAGGATCCTC 660 

TGTGGTTCCA GACGGAAACC ACTGTCCAGG CTGTACGTTA CAATCTCTCT CACAGTGATG 72 0 

GTCTACCTCA TCTGCGGCCT GCCTCTCGGG CTTTACTTGT TC CTGC TATA TTGGTTTGGG 7 80 

10 

ATCCATTTAC ATTATCCCTT TTGTC ACATT TACCAAGTTA CTGTGCTCCT GTC CTGTGTG ' 840 

AACAGCTCTG CCAACCCCAT CATTTACTTC CTTGTAGGGT CCTTTAGGCA CCGTAAAAAG 900 

15 CATCGGTCCC TCAAAATGGT TCTTAAAAGG GCTCTGGAGG AGACTC CTGA GGAGGATGAA 960 

TATACAGACA GCCATGTTCA GAAACCCACT GAGATCTCAG AAAGGAGATG T 1011 

20 ( 2 ) INFORMATION FOR SEQ ID NO : 3 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 322 amino acids 

(B) TYPE: amino acid 

25 (C) STRANDEDNESS : not relevant 

(D) TOPOLOGY: not relevant 

(ii) MOLECULE TYPE: protein 



30 (iii) HYPOTHETICAL: NO 
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(iv) ANTI-SENSE: NO 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 3 : 

Met Asp Pro Thr lie Pro Val Leu Gly Thr Lys Leu Thr Pro He Asn 
15 10 15 

Gly Arg Glu Glu Thr Pro Cys Tyr Asn Gin Thr Leu Ser Phe Thr Gly 
20 25 30 



Leu Thr Cys He He Ser Leu Val Ala Leu Thr Gly Asn Ala Val Val 
35 40 45 

Leu Trp Leu Leu Gly Cys Arg Met Arg Arg Asn Ala Val Ser lie Tyr 
50 55 60 

He Leu Asn Leu Val Ala Ala Asn Phe Leu Phe Leu Ser Gly His He 
65 70 75 80 

He Phe Ser Pro Leu Pro Leu He Asn He Arg His Pro He Ser Lys 
85 90 95 

He Leu Ser Pro Val Met Thr Phe Pro Tyr Phe He Gly Leu Ser Met 
100 105 HO 

Leu Ser Ala He Ser Thr Glu Arg Cys Leu Ser He Leu Trp Pro He 
115 120 125 

Trp Tyr His Cys Arg Arg Pro Arg Tyr Leu Ser Ser Val Met Cys Val 



32 



130 135 140 

Leu Leu Trp Ala Leu Ser Leu Leu Arg Ser lie Leu Glu Trp Met Phe 
145 150 155 160 

Cys Asp Phe Leu Phe Ser Gly Ala Asn Ser Val Trp Cys Glu Thr Ser 
165 170 175 

Asp Phe lie Thr lie Ala Trp Leu Val Phe Leu Cys Val Val Leu Cys 
180 185 190 



Gly Ser Ser Leu Val Leu Leu Val Arg lie Leu Cys Gly Ser Arg Lys 
195 200 205 

Met Pro Leu Thr Arg Leu Tyr Val Thr lie Leu Leu Thr Val Leu Val 
210 215 220 

Phe Leu Leu Cys Gly Leu Pro Phe Gly lie Gin Trp Ala Leu Phe Ser 
225 230 235 240 

Arg lie His Leu Asp Trp Lys Val Leu Phe Cys His Val His Leu Val 
245 250 255 

Ser lie Phe Leu Ser Ala Leu Asn Ser Ser Ala Asn Pro lie lie Tyr 
260 265 270 



Phe Phe Val Gly Ser Phe Arg Gin Arg Gin Asn Arg Gin Asn Leu Lys 
275 280 285 
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Leu Val Leu Gin Arg Ala Leu Gin Asp Thr Pro Glu Val Asp Glu Gly 
290 295 300 

Gly Gly Trp Leu Pro Gin Glu Thr Leu Glu Leu Ser Gly Ser Lys Leu 
305 310 315 320 

Glu Gin 

(2|~ INFORMATION FOR " ~SBQ ID ~NO :~4 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 969 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(iii) HYPOTHETICAL : NO 
(iv) ANTI-SENSE: NO 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 4: 
ATGGATCCAA CCATCCCAGT C TTGGGT AC A AAACTGACAC CAATCAACGG ACGTGAGGAG 
ACTCCTTGCT ACAACCAAAC CCTGAGCTTC ACGGGGCTGA CGTGCATCAT TTCCCTTGTC 
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GCGCTGACAG GAAACGCGGT TGTGCTCTGG 
GTCTCCATCT ACATCCTCAA CCTGGTCGCG 
5 ATATTTTCGC CGTTACCCCT CATCAATATC 
GTGATGACCT TTCCCTACTT TATAGGCCTA 
TGCCTGTCCA TCCTGTGGCC CATC TGGT AC 

10 

GTCATGTGTG TCCTGCTCTG GGCCCTGTCC 

TGTGACTTCC TGTTTAGTGG TGCTAATTCT 

15 ATCGCGTGGC TGGTTTTTTT ATGTGTGGTT 

AGGATTCTCT GTGGATCCCG GAAGATGCCG 

ACAGTGCTGG TCTTCCTCCT CTGTGGCCTG 
20 

AGGATCCACC TGGATTGGAA AGTCTTATTT 
TCCGCTCTTA ACAGCAGTGC CAACCCCATC 
25 CGTCAAAATA GGCAAAACCT GAAGCTGGTT 
GTGGATGAAG GTGGAGGGTG GCTTCCTCAG 
GAGCAGTGA 

30 
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CTCCTGGGCT GCCGCATGCG CAGGAACGCT 180 



GCCAACTTCC TCTTCCTTAG CGGCCACATT 240 



CGCCATCCCA TCTCCAAAAT CCTCAGTCCT 3 00 



AGCATGCTGA GCGCCATCAG CACCGAGCGC 360 



CACTGCCGCC GCCCCAGATA CCTGTCATCG 42 0 



CTGCTGCGGA GTATCCTGGA GTGGATGTTC 480 



GTTTGGTGTG AAACGTCAGA TTTCATTACA 540 



CTCTGTGGGT CCAGCCTGGT CCTGCTGGTC 600 



CTGACCAGGC TGTACGTGAC CATCCTCCTC 660 



CCCTTTGGCA TTCAGTGGGC CCTGTTTTC C 72 0 



TGTCATGTGC ATCTAGTTTC CATTTTCCTG 7 80 



ATTTACTTCT TCGTGGGCTC CTTTAGGCAG 840 



CTCCAAAGGG CTCTGCAGGA CACGCCTGAG 900 



GAAACCCTGG AGCTGTCGGG AAGCAAATTG 960 



969 
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(2) INFORMATION FOR SEQ ID NO : 5 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 322 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : not relevant 

(D) TOPOLOGY: not relevant 

(ii) MOLECULE TYPE: protein 
( i i i ) HYPOTHETICAL : NO 
(iv) ANTI-SENSE: NO 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 5 : 

Met Asp Pro Thr Val Pro Val Leu Gly Thr Glu Leu Thr Pro He Asn 
15 10 15 

Gly Arg Glu Glu Thr Pro Cys Tyr Lys Gin Thr Leu Ser Phe Thr Gly 
20 25 30 

Leu Thr Cys He Val Ser Leu Val Ala Leu Thr Gly Asn Ala Val Val 
35 40 45 

Leu Trp Leu Leu Gly Cys Arg Met Arg Arg Asn Ala Val Ser He Tyr 
50 55 eo 

He Leu Asn Leu Val Ala Ala Asp Phe Leu Phe Leu Ser Gly His He 
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65 70 75 



80 



lie Cys Ser Pro Leu Arg Leu lie Asn He Ser His Pro He Ser Lys 
85 90 95 

He Leu Ser Pro Val Met Thr Phe Pro Tyr Phe He Gly Leu Ser Met 
100 105 no 

Leu Asn Ala He Ser Thr Glu Arg Cys Leu Ser He Leu Trp Pro He 
H5 120 125 

Trp Tyr His Cys Arg Arg Pro Arg Tyr Leu Ser Ser Val Met Cys Val 
130 135 140 

Leu Leu Trp Ala Pro Ser Leu Leu Arg Ser He Leu Glu Trp Met Phe 
145 150 155 160 

Cys Asp Phe Leu Phe Ser Gly Ala Asp Ser Val Arg Cys Glu Thr Ser 
165 170 175 

Asp Phe He Thr He Ala Trp Leu Val Phe Leu Arg Val Val Leu Cys 
180 185 190 

Gly Ser Ser Leu Val Leu Leu Val Arg He Leu Cys Gly Ser Arg Lys 
195 200 205 

Met Pro Leu Thr Arg Leu Tyr Val Thr He Leu Leu Thr Val Leu Val 
210 215 220 



Phe Leu Leu Cys Gly Leu Pro Phe Gly He Gin Trp Ala Leu Phe Ser 
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225 230 



235 240 



Arg He His Leu Asp Trp Lys Val Leu Phe Cys His Val His Leu Val 
245 250 255 



Ser He Phe Leu Ser Ala Leu Asn Ser Ser Ala Asn Pro He He Tyr 
260 265 270 

Phe Phe Met Gly Ser Phe Arg Gin Leu Gin Asn Arg Lys Thr Leu Lys 
275 230 285 

Leu Val Leu Gin Arg Asp Leu Gin Asp Thr Pro Glu Val Asp Glu Gly 
2^0 295 300 

Gly Trp Trp Leu Pro Gin Glu Thr Leu Glu Leu Ser Gly Ser Lys Leu 
305 310 315 320 

Glu He 



INFORMATION FOR SEQ ID NO : 6 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 9 69 base pairs 

(B) TYPE: nucleic acid 
<C) STRANDEDNE SS : double 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
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(iii) HYPOTHETICAL: NO 



(iv) ANTI-SENSE: NO 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 6: 



ATGGATCCAA CCGTCCCAGT CTTGGGTACA GAACTGACAC CAATCAACGG ACGTGAGGAG 



ACTCCTTGCT ACAAGCAGAC CCTGAGCTTC ACGGGGCTGA CGTGCATCGT TTCCCTTGTC 



GCGCTGACAG GAAACGCGGT TGTGCTCTGG CTCCTGGGCT GCCGCATGCG CAGGAACGCT 



GTCTCCATCT ACATCCTCAA CCTGGTCGCG GCCGACTTCC TCTTCCTTAG CGGCCACATT 



ATATGTTCGC CGTTACGCCT CATCAATATC AGCCATCCCA TCTCCAAAAT CCTCAGTCCT 



GTGATGACCT TTCCCTACTT TATAGGCCTA AGCATGCTGA ACGC CATC AG CACCGAGCGC 



TGCCTGTCCA TCCTGTGGCC CATCTGGTAC CACTGCCGCC GCCCCAGATA CCTGTCATCG 



GTCATGTGTG TCCTGCTCTG GGCCCCGTCC CTGCTGCGGA GTATCCTGGA GTGGATGTTC 



TGTGACTTCC TGTTTAGTGG TGCTGATTCT GTTCGGTGTG AAACGTCAGA TTTCATTACA 



ATCGCGTGGC TGGTTTTTTT ACGTGTGGTT CTCTGTGGGT CCAGCCTGGT CCTGCTGGTC 



AGGATTCTCT GTGGATCCCG GAAGATGCCG CTGACCAGGC TGTACGTGAC CATCCTCCTC 



ACAGTGCTGG TCTTCCTCCT CTGTGGCCTG CCCTTTGGCA TTCAGTGGGC CCTGTTTTCC 
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AGGATCCACC TGGATTGGAA AGTCTTATTT TGTCATGTGC ATCTAGTTTC CATTTTCCTG 
TCCGCTCTTA ACAGCAGTGC CAACCCCATC ATTTACTTCT TCATGGGCTC CTTTAGGCAG 
CTTCAAAACA GGAAGACCCT CAAGCTGGTT CTCCAGAGGG ATCTGCAGGA CACGCCTGAG 
GTGGATGAAG GTGGATGGTG GCTTCCTCAG GAAACCCTGG AGCTGTCGGG AAGCAAATTG 
GAGATCTGA 



(2) INFORMATION FOR SEQ ID NO : 7 : 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 32 2 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : not relevant 

(D) TOPOLOGY: not relevant 

(ii) MOLECULE TYPE: protein 
(iii) HYPOTHETICAL: NO 
(iv) ANTI-SENSE: NO 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:7: 

Met Asp Pro Thr Val Ser Thr Leu Asp Thr Glu Leu Thr Pro lie Asn 
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Gly Thr Glu Glu Thr Leu Cys Tyr Lys Gin Thr Leu Ser Leu Thr Val 
20 25 30 



Leu Thr Cys lie Val Ser Leu Val Gly Leu Thr Gly Asn Ala Val Val 
35 40 45 



Leu Trp Leu Leu Gly Cys Arg Met Arg Arg Asn Ala Phe Ser He Tyr 
50 55 60 



I-le- Leu- Asrr Leu- Ala " Ala" Ala" Asp 
65 70 

He Tyr Ser Leu Leu Ser Phe He 
85 

He Leu Tyr Pro Val Met Met Phe 
100 

Leu Ser Ala Val Ser Thr Asp Arg 
115 120 



Phe Leu Phe Eeu Ser" Gly Arg Leu 
75 80 

Ser He Pro His Thr He Ser Lys 
90 95 

Ser Tyr Phe Ala Gly Leu Asn Phe 
105 no 

Cys Leu Ser Val Leu Trp Pro He 
125 



Trp Tyr Arg Cys His Arg Pro Thr His Leu Ser Ala Val Val Cys Val 
130 135 140 



Leu Leu Trp Ala Leu Ser Leu Leu Arg Ser He Leu Glu Trp Met Leu 
145 150 155 160 



Cys Gly Phe Leu Phe Ser Gly Ala Asp Ser Ala Trp Cys Gin Thr Ser 
165 170 175 
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Asp Phe lie Thr Val Ala Trp Leu lie Phe Leu Cys Val Val Leu Cys 
180 185 190 

Gly Ser Ser Leu Val Leu Leu lie Arg lie Leu Cys Gly Ser Arg Lys 
195 200 205 

He Pro Leu Thr Arg Leu Tyr Val Thr He Leu Leu Thr Val Leu Val 
210 215 220 

Phe Leu Leu Cys Gly Leu Pro Phe Gly He Gin Phe Phe Leu Phe Leu 
225 230 235 240 

Trp He His Val Asp Arg Glu Val Leu Phe Cys His Val His Leu Val 
245 250 255 

Ser He Phe Leu Ser Ala Leu Asn Ser Ser Ala Asn Pro He He Tyr 
260 265 270 

Phe Phe Val Gly Ser Leu Arg Gin Arg Gin Asn Arg Gin Asn Leu Lys 
275 280 285 

Leu Val Leu Gin Arg Ala Leu Gin Asp Thr Pro Glu Val Asp Glu Gly 
290 295 300 

Gly Gly Trp Leu Pro Gin Glu Thr Leu Glu Leu Ser Gly Ser Arg Leu 
305 310 315 320 



Glu Gin 



N 1807-1 SE 
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(2) INFORMATION FOR SEQ ID NO : 8 : 

5 (i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 969 base pairs 

(B) TYPE: nucleic acid 

(C) . STRANDEDNESS: double 

(D) TOPOLOGY: linear 

10 

Cxi") - "MOEECUEET TYPE : DNA (^em>fhic~) 
<iii) HYPOTHETICAL: NO 
15 (iv) ANTI-SENSE: NO 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 8: 
ATGGATCCAA CCGTCTCAAC CTTGGACACA GAACTGACAC CAATCAACGG AACTGAGGAG 

20 

ACTC TTTGC T ACAAGCAGAC CTTGAGCCTC ACGGTGCTGA CGTGCATCGT TTCCCTTGTC 
GGGCTGACAG GAAACGCAGT TGTACTCTGG CTCCTGGGCT GCCGCATGCG C AGGAACGC C 
25 TTCTCCATCT ACATCCTCAA CTTGGCCGCA GCAGACTTCC TCTTCCTCAG CGGCCGCCTT 
ATATATTCCC TGTTAAGCTT CATCAGTATC CCCCATACCA TCTCTAAAAT CCTCTATCCT 



180 



240 



GTGATGATGT TTTCCTACTT TGCAGGCCTG AACTTTCTGA GTGCCGTGAG CACCGATCGC 

30 



360 
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TGCCTGTCCG TCCTGTGGCC CATCTGGTAC CGCTGCCACC GCCCCACACA CCTGTCAGCG 42 0 

GTGGTGTGTG TCCTGCTCTG GGCCCTGTCC CTGCTGCGGA GCATCCTGGA ATGGATGTTA 480 

5 TGTGGCTTCC TGTTCAGTGG TGCTGATTCT GCTTGGTGTC AAACATCAGA TTTCATCACA 540 

GTCGCGTGGC TGATTTTTTT ATGTGTGGTT CTCTGTGGGT CCAGCCTGGT CCTGCTGATC 600 

AGGATTCTCT GTGGATCCCG GAAGATACCG CTGACCAGGC TGTACGTGAC CATCCTGCTC 660 

10 

ACAGTACTGG TCTTCCTCCT CTGTGGCCTG C^CTTTGGCA TTCAGTTTTT CCTATTTTTA 720 

TGGATCCACG TGGACAGGGA AGTCTTATTT TGTCATGTGC ATCTAGTTTC CATTTTCCTG 780 

15 TCCGCTCTTA ACAGCAGTGC CAACCCCATC ATTTACTTCT TCGTGGGCTC CCTTAGGCAG 840 

CGTCAAAATA GGCAGAACCT GAAGCTGGTT CTCCAGAGGG CTCTGCAGGA CACGCCTGAG 900 

GTGGATGAAG GTGGAGGGTG GCTTCCTCAG GAAAC CCTGG AGCTGTCGGG AAGCAGATTG 960 

20 

GAGCAGTGA 969 



(2) INFORMATION FOR SEQ ID NO: 9: 

25 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 3 22 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : not relevant 
30 (D) TOPOLOGY: not relevant 
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(ii) MOLECULE TYPE: protein 
(iii) HYPOTHETICAL : NO 
(iv) ANTI-SENSE: NO 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 9 : 

Met Asp Pro Thr Val Ser Thr Leu Asp Thr Glu Leu Thr Pro lie Asn 
1 5 10 15 

Gly Thr Glu Glu Thr Leu Cys Tyr Lys Gin Thr Leu Ser Leu Thr Val 
20 25 30 

Leu Thr Cys lie Val Ser Leu Val Gly Leu Thr Gly Asn Ala Val Val 
35 40 45 

Leu Trp Leu Leu Gly Cys Arg Met Arg Arg Asn Ala Phe Ser He Tyr 
50 55 60 

He Leu Asn Leu Ala Ala Ala Asp Phe Leu Phe Leu Ser Gly Arg Leu 
65 70 75 80 

lie Tyr Ser Leu Leu Ser Phe He Ser lie Pro His Thr He Ser Lys 
85 90 95 

He Leu Tyr Pro Val Met Met Phe Ser Tyr Phe Ala Gly Leu Ser Phe 



100 



105 



110 
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Leu Ser Ala Val Ser Thr Glu Arg Cys Leu Ser Val Leu Trp Pro lie 
115 120 125 

Trp Tyr Arg Cys His Arg Pro Thr His Leu Ser Ala Val Val Cys Val 
130 135 . 140 

Leu Leu Trp Ala Leu Ser Leu Leu Arg Ser lie Leu Glu Trp Met Leu 
145 150 155 160 

Cys Gly Phe Leu Phe Ser Gly Ala Asp Ser Ala Trp Cys Gin Thr Ser 
TG5 T70 17 5 

Asp Phe lie Thr Val Ala Trp Leu lie Phe Leu Cys Val Val Leu Cys 
180 185 190 

Gly Ser Ser Leu Val Leu Leu lie Arg lie Leu Cys Gly Ser Arg Lys 
195 200 205 

He Pro Leu Thr Arg Leu Tyr Val Thr He Leu Leu Thr Val Leu Val 
210 215 220 

Phe Leu Leu Cys Gly Leu Pro Phe Gly He Gin Phe Phe Leu Phe Leu 
225 230 235 240 

Trp He His Val Asp Arg Glu Val Leu Phe Cys His Val His Leu Val 
245 250 255 

Ser He Phe Leu Ser Ala Leu Asn Ser Ser Ala Asn Pro He He Tyr 
260 265 270 



SE 



Phe Phe Val Gly Ser Phe Arg Gin 
275 280 

Leu Val Leu Gin Arg Ala Leu Gin 
290 295 

Gly Gly Gin Leu Pro Gin Glu Thr 
305 310 

Glu Gin 
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Arg Gin Asn Arg Gin Asn Leu Lys 
285 



Asp Ala Ser Glu Val Asp Glu Gly 
300 



Leu Glu Leu Ser Gly Ser Arg Leu 
315 320 



(2) INFORMATION FOR SEQ ID NO: 10: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 969 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(iii) HYPOTHETICAL: NO 
(iv) ANTI-SENSE: NO 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 10: 
ATGGATCCAA CGGTCTCAAC CTTGGACACA GAATTGACAC CAATCAACGG AACTGAGGAG 
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ACTCTTTGCT AC AAGC AG AC CTTGAGCCTC 
GGGCTGACAG GAAACGCGGT TGTGCTCTGG 
5 TTCTCCATCT ACATCCTCAA CTTGGCCGCA 
ATATATTCCC TGTTAAGCTT CATCAGTATC 
GTGATGATGT TTTCCTACTT TGCAGGCCTG 

10 

TC^CTGTCCG TCCTGTGGCC CATCTGGTAC 

GTGGTGTGTG TCCTGCTCTG GGCCCTGTCC 

15 TGTGGCTTCC TGTTCAGTGG TGCTGATTCT 

GTCGCGTGGC TGATTTTTTT ATGTGTGGTT 

AGGATTCTCT GTGGATCCCG GAAG AT AC CG 
20 

AC AGT AC TGG TCTTCCTCCT CTGTGGCCTG 

TGGATCCACG TGGACAGGGA AGTCTTATTT 

25 TCCGCTCTTA ACAGCAGTGC CAACCCCATC 

CGTCAAAATA GGCAGAACCT GAAGCTGGTT 

GTGGATGAAG GTGGAGGGCA GCTTCCTGAG 
30 
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ACGGTGCTGA CGTGCATCGT TTCCCTTGTC 12 0 



CTCCTGGGCT GCCGCATGCG CAGGAACGCC 180 



GCAGACTTCC TCTTCCTCAG CGGCCGCCTT 240 



CCCCATACCA TCTCTAAAAT CCTCTATCCT 3 00 



AGCTTTCTGA GTGCCGTGAG CACCGAGCGC 3 60 



CGCTGCCACC GCCCCACACA CCTGTCAGCG 42 0 



CTGCTGCGGA GCATCCTGGA GTGGATGTTA 480 



GCTTGGTGTC AAACATCAGA TTTCATCACA 540 



CTCTGTGGGT CCAGCCTGGT CCTGC TGATC 600 



CTGACCAGGC TGTACGTGAC CATCCTGCTC 660 



CCCTTTGGCA TTCAGTTTTT CCTATTTTTA 720 



TGTCATGTTC ATCTAGTTTC TATTTTCCTG 780 



ATTTACTTCT TCGTGGGCTC CTTTAGGCAG 840 



CTCCAGAGGG C TCTGCAGG A CGCGTCTGAG 900 



GAAATCCTGG AGCTGTCGGG AAGCAGATTG 960 
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GAGCAGTGA 969 



(2) INFORMATION FOR SEQ ID NO: 11: 

5 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 322 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : not relevant 
10 (D) TOPOLOGY: not relevant 



15 



20 
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(ii) MOLECULE TYPE: protein 
(iii) HYPOTHETICAL: NO 
(iv) ANTI-SENSE: NO 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 11: 

Met Asp Pro Thr Val Pro Val Leu Gly Thr Lys Leu Thr Pro lie Asn 
15 10 is 

Gly Arg Glu Glu Thr Pro Cys Tyr Lys Gin Thr Leu Ser Phe Thr Val 
20 25 30 

Leu Thr Cys He He Ser Leu Val Gly Leu Thr Gly Asn Ala Val Val 
35 40 45 



30 



Leu Trp Leu Leu Gly Cys Arg Met Arg Arg Asn Ala Val Ser He Tyr 
50 55 60 
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He Leu Asn Leu Ala Ala Ala Asp Phe Leu Phe Leu Ser Phe Gin He 
65 70 75 80 

He Cys Arg Pro Leu Arg Leu He Asn He Ser His Leu He Arg Lys 
85 90 95 

He Leu Val Ser Val Met Thr Phe Pro Tyr Phe Thr Gly Leu Ser Met 
100 105 no 

Leu Ser Ala He Ser Thr- Glu Arg- Cys Leu Ser Val Leu Trp- Pro He 
115 120 125 

Trp Tyr Arg Cys Arg Arg Pro Thr His Leu Ser Ala Val Val Cys Val 
130 135 140 

Leu Leu Trp Ala Gly Leu Leu Leu Phe Ser Met Leu Glu Trp Arg Phe 
145 150 155 160 

Cys Asp Phe Leu Phe Ser Gly Ala Asp Ser Ser Trp Cys Glu Thr Ser 
165 170 175 

Asp Phe He Pro Val Ala Trp Leu He Phe Leu Cys Val Val Leu Cys 
180 185 190 

Val Ser Ser Leu Val Leu Leu Val Arg He Leu Cys Gly Ser Arg Lys 
195 200 205 



Met Pro Leu Thr Arg Leu Tyr Val Thr He Leu Leu Thr Val Leu Val 
210 215 220 
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Phe Leu Leu Cys Gly Leu Pro Phe Gly lie Leu Gly Ala Leu He Tyr 
225 230 235 240 

Arg Met His Leu Asn Leu Glu Val Leu Tyr Cys His Val Tyr Leu Val 
245 250 255 

Cys Met Ser Leu Ser Ser Leu Asn Ser Ser Ala Asn Pro He He Tyr 
260 265 270 

Phe Phe- Vad- Gly Ser Phe Arg- Gin Arg" "Gin" Asn" Arg Gin Asn Leu Lys" 
275 280 285 

Leu Val Leu Gin Arg Ala Leu Gin Asp Lys Pro Glu Val Asp Lys Gly 
290 295 300 

Glu Gly Gin Leu Pro Glu Glu Ser Leu Glu Leu Ser Gly Arg Arg Leu 
305 310 315 320 

Gly Pro 



INFORMATION FOR SEQ ID NO: 12: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 969 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



N 1807-1 SE 
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(ii) MOLECULE TYPE: DNA (genomic) 
(iii) HYPOTHETICAL: NO 
5 (iv) ANTI-SENSE: NO 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 12: 
ATGGATCCAA CCGTCCCAGT CTTGGGTACA AAACTGACAC CAATCAACGG ACGTGAGGAG 

10 



60 



ACTCCT.TGCT ACAAGCAGAC CCTGAGGTTG ACGGTGCTGA CGTGCATCAT TTCCCTTGTC 120 

GGACTGACAG GAAACGCGGT TGTGCTCTGG CTCCTGGGCT GCCGCATGCG CAGGAACGCT 180 

15 GTCTCCATCT ACATCCTCAA CCTGGCCGCA GCAGACTTCC TCTTCCTCAG CTTCCAAATT 240 

ATACGTTCGC CATTACGCCT CATCAATATC AGCCATCTCA TCCGCAAAAT CCTCGTTTCT 3 00 

GTGATGACCT TTCCCTACTT TACAGGCCTG AGTATGCTGA GCGCCATCAG CACCGAGCGC 3 60 

20 

TGCCTGTCTG TTCTGTGGCC CATCTGGTAC CGCTGCCGCC GCCCCACACA CCTGTCAGCG 420 

GTCGTGTGTG TCCTGCTCTG GGGCCTGTCC CTGCTGTTTA GTATGCTGGA GTGGAGGTTC 480 

25 TGTGACTTCC TGTTTAGTGG TGCTGATTCT AGTTGGTGTG AAACGTCAGA TTTCATCCCA 540 

GTCGCGTGGC TGATTTTTTT ATGTGTGGTT CTCTGTGTTT CCAGCCTGGT CCTGCTGGTC 600 

AGGATCCTCT GTGGATCCCG GAAGATGCCG CTGACCAGGC TGTATGTGAC CATCCTGCTC 660 

30 
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AC AGTGC TGG TCTTCCTCCT CTGCGGCCTG CCCTTCGGCA TTCTGGGGGC CCTAATTTAC 72 0 

AGGATGCACC TGAATTTGGA AGTCTTATAT TGTCATGTTT ATCTGGTTTG CATGTCCCTG 780 

5 TCCTCTCTAA ACAGTAGTGC CAACCCCATC ATTTACTTCT TCGTGGGCTC CTTTAGGCAG 840 

CGTCAAAATA GGCAGAACCT GAAGC TGGTT CTCCAGAGGG CTCTGCAGGA CAAGCCTGAG 900 

GTGGATAAAG GTGAAGGGCA GCTTCCTGAG GAAAGC C TGG AGCTGTCGGG AAGGAGATTG 960 



10 

GGGC C ATGA- 



(2) INFORMATION FOR SEQ ID NO: 13: 

15 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 322 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : not relevant 
20 (D) TOPOLOGY: not relevant 

(ii) MOLECULE TYPE: protein 



25 



(iii) HYPOTHETICAL: NO 



(iv) ANTI-SENSE: NO 



30 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 13: 



Met Asp Pro Thr Val Pro Val Phe Gly Thr Lys Leu Thr Pro lie Asn 



959 
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15 10 15 

Gly Arg Glu Glu Thr Pro Cys Tyr Asn Gin Thr Leu Ser Phe Thr Val 
20 25 30 

Leu Thr Cys lie lie Ser Leu Val Gly Leu Thr Gly Asn Ala Val Val 
35 40 45 

Leu Trp Leu Leu Gly Tyr Arg Met Arg Arg Asn Ala Val Ser lie Tyr 
50 55 60 

lie Leu Asn Leu Ala Ala Ala Asp Phe Leu Phe Leu Ser Phe Gin lie 
65 70 75 80 

He Arg Ser Pro Leu Arg Leu He Asn He Ser His Leu He Arg Lys 
85 90 95 

He Leu Val Ser Val Met Thr Phe Pro Tyr Phe Thr Gly Leu Ser Met 
100 105 no 

Leu Ser Ala He Ser Thr Glu Arg Cys Leu Ser Val Leu Trp Pro He 
115 120 125 

Trp Tyr Arg Cys Arg Arg Pro Thr His Leu Ser Ala Val Val Cys Val 
130 135 140 

Leu Leu Trp Gly Leu Ser Leu Leu Phe Ser Met Leu Glu Trp Arg Phe 
145 150 155 160 



Cys Asp Phe Leu Phe Ser Gly Ala Asp Ser Ser Trp Cys Glu Thr Ser 
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165 170 175 

Asp Phe He Pro Val Val Trp Leu He Phe Leu Cys Val Val Leu Cys 
180 185 190 

5 

Val Ser Ser Leu Val Leu Leu Val Arg He Leu Cys Gly Ser Arg Lys 
195 200 205 

Met Pro Leu Thr Arg Leu Tyr Val Thr He Leu Leu Thr Val Leu Val 
"> 210 215 220 

Phe Leu Leu Cys Gly Leu Pro Phe Gly He Leu Gly Ala Leu He Tyr 
225 230 235 240 



15 



20 



Arg Met His Leu Asn Leu Glu Val Leu Tyr Cys His Val Tyr Leu Val 
245 250 255 

Cys Met Ser Leu Ser Ser Leu Asn Ser Ser Ala Asn Pro He He Tyr 
260 265 270 

Phe Phe Val Gly Ser Phe Arg Gin Arg Gin Asn Arg Gin Asn Leu Lys 
275 280 285 



Leu Val Leu Gin Arg Ala Leu Gin Asp Lys Pro Glu Val Asp Lys Gly 
25 290 295 300 

Glu Gly Gin Leu Pro Glu Glu Ser Leu Glu Leu Ser Gly Ser Lys Leu 
305 310 315 320 

30 Gly Pro 
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(2) INFORMATION FOR SEQ ID NO: 14: 

5 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 969 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

10 

lii) MOLECULE" TYPE ^ DNA (genomic) 
(iii) HYPOTHETICAL: NO 
15 (iv) ANTI-SENSE: NO 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 14: 
ATGGATCCAA CCGTCCCAGT CTTCGGTACA AAACTGACAC CAATCAACGG ACGTGAGGAG 60 

20 

ACTCCTTGCT ACAATCAGAC CCTGAGCTTC ACGGTGCTGA CGTGCATCAT TTCCCTTGTC 120 
GGAC TG AC AG GAAACGCGGT TGTGCTC TGG CTC C TGGGCT ACCGCATGCG CAGGAACGCT 180 
25 GTCTCCATCT ACATCCTCAA CCTGGCCGCA GCAGACTTCC TCTTCCTCAG CTTCCAAATT 240 
ATACGTTCGC CATTACGCCT CATCAATATC AGC C ATCTC A TCCGCAAAAT CCTCGTTTCT 300 
GTGATGACCT TTCCCTACTT TACAGGCCTG AGTATGCTGA GCGCCATCAG CACCGAGCGC 3 60 
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TGCCTGTCTG TTCTGTGGCC CATCTGGTAC CGCTGCCGCC GCCCCACACA CCTGTCAGCG 42 0 

GTCGTGTGTG TCCTGCTCTG GGGCCTGTCC CTGCTGTTTA GTATGCTGGA GTGGAGGTTC 480 

5 TGTGACTTCC TGTTTAGTGG TGCTGATTCT AGTTGGTGTG AAACGTCAGA TTTCATCCCA 540 

GTCGTGTGGC TGATTTTTTT ATGTGTGGTT CTCTGTGTTT CCAGCCTGGT CCTGCTGGTC 600 

AGGATCCTCT GTGGATCCCG GAAGATGCCG CTGACCAGGC TGTACGTGAC CATCCTGCTC 660 

10 

ACAGTGCTGG TCTTCCTCCT" CTGCGGCCTG" CCCTTCGGC AT TTCTGGGGGC ~ CCTAATTTAC 72"0" 

AGGATGCACC TGAATTTGGA AGTCTTATAT TGTCATGTTT ATCTGGTTTG CATGTCCCTG 780 

15 TCCTCTCTAA ACAGTAGTGC CAACCCCATC ATTTACTTCT TCGTGGGCTC CTTTAGGCAG 840 

CGTCAAAATA GGCAGAACCT GAAGCTGGTT CTCCAAAGGG CTCTGCAGGA CAAGCCTGAG 900 

GTGGATAAAG GTGAAGGGCA GCTTCCTGAG GAAAGCCTGG AGCTGTCGGG AAGCAAATTG 960 



20 

GGGCCATGA 



(2) INFORMATION FOR SEQ ID NO: 15: 

25 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3 5 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 
30 (D) TOPOLOGY: linear 
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(ii) MOLECULE TYPE: other nucleic acid 

(A) DESCRIPTION: /desc = 'synthetic PCR primer" 

(iii) HYPOTHETICAL: NO 
(iv) ANT I- SENSE: NO 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 15: 
GGCCGTCGAC-TTCATCGTCW-MYCTIKCIYT IGCNG~ " 

(2) INFORMATION FOR SEQ ID NO: 16: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 15 base pairs 

(B) TYPE: nucleic acid 

( C ) STRANDEDNESS : S ingle 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 

(A) DESCRIPTION: /desc = "synthetic PCR primer" 

(iii) HYPOTHETICAL: NO 

(iv) ANTI-SENSE: NO 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 16: 
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RHWRCARTAI AT I AT 

(2) INFORMATION FOR SEQ ID NO: 17: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 5 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 

(A) DESCRIPTION: /desc = "synthetic PCR primer* 

(iii) HYPOTHETICAL: NO 

(iv) ANTI-SENSE: NO 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 17: 
CGCAGATGAG GTAGTACAGC ATCAC 

(2) INFORMATION FOR SEQ ID NO: 18: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 5 base pairs 

(B) TYPE: nucleic acid 
<C) STRANDEDNESS: single 
(D) TOPOLOGY: linear 
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(ii) MOLECULE TYPE: other nucleic acid 

(A) DESCRIPTION: /desc = "synthetic PCR primer" 

(iii) HYPOTHETICAL: NO 
(iv) ANTI-SENSE: NO 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 18: 
CTGTGAGAGA" GATGGTAACA" TACAG 

(2) INFORMATION FOR SEQ ID NO: 19: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 

(A) DESCRIPTION: /desc = "synthetic PCR primer" 

(iii) HYPOTHETICAL: NO 

(iv) ANTI-SENSE: NO 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 19: 
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GCATCCTTGA CTGGTTCTTC TCAG 



(2) INFORMATION FOR SEQ ID NO: 20: 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 5 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 
10 (D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 

(A) DESCRIPTION: /desc = "synthetic PCR primer" 

15 (iii) HYPOTHETICAL: NO 

(iv) ANTI-SENSE: NO 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 20: 
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GGGTG AG AC T CATCATCATT TGTGG 25 



(2) INFORMATION FOR SEQ ID NO: 21: 

25 

— . (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 30 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 
30 (D) TOPOLOGY: linear 
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(ii) MOLECULE TYPE: other nucleic acid 

(A) DESCRIPTION: /desc = "synthetic PCR primer" 

(iii) HYPOTHETICAL: NO 
(iv) ANTI-SENSE : NO 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 21: 
GCAAGCTTTC TGAGCATGGA TCCAACCGTC 

(2) INFORMATION FOR SEQ ID NO: 22: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 30 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 

(A) DESCRIPTION: /desc = "synthetic PCR primer" 

(iii) HYPOTHETICAL: NO 

(iv) ANTI-SENSE: NO 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 22: 
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CCCTCAGATC TCCAATTTGC TTCCCGACAG 30 



5 
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CLAIMS 

1. A protein, except as existing in nature, comprising the amino acid sequence consisting 
functionally of rat dorsal root receptor 1 (DRR-1) as shown in SEQ ID NO:l. 

2. A substantially pure polynucleotide encoding a protein comprising the amino acid 
sequence consisting functionally of rat DRR-1 as shown in SEQ ED NO:l. 

3. The polynucleotide of claim 2, wherein said polynucleotide encodes a protein 

consisting essentially of the amino acid sequence of SEQ ID NO: 1. 

4 ; The polynucleotide of claim 3, wherein said polynucleotide has the nucleotide 
sequence of SEQ ID NO:2. 

5. A protein, except as existing in nature, comprising the amino acid sequence consisting 
functionally of human dorsal root receptor 1 (DRR-1) as shown in SEQ ID NO:3. 

6. A substantially pure polynucleotide encoding a protein comprising the amino acid 
sequence consisting functionally of human DRR-1 as shown in SEQ ID NO:3. 

7. The polynucleotide of claim 6, wherein said polynucleotide encodes a protein 
consisting essentially of the amino acid sequence of SEQ ID NO:3. 

8. The polynucleotide of claim 7, wherein said polynucleotide has the nucleotide 
sequence of SEQ ID NO:4. 

9. A protein, except as existing in nature, comprising the amino acid sequence consisting 
functionally of human dorsal root receptor 2 (DRR-2) as shown in SEQ ID NO:5. 

10. A substantially pure polynucleotide encoding a protein comprising the amino acid 
sequence consisting functionally of human DRR-2 as shown in SEQ ID NO:5. 
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1 1 . The polynucleotide of claim 10, wherein said polynucleotide encodes a protein 
consisting essentially of the amino acid sequence of SEQ ID NO:5. 

12. The polynucleotide of claim 11, wherein said polynucleotide has the nucleotide 
sequence of SEQ ID NO:6. 

13. A protein, except as existing in nature, comprising the amino acid sequence consisting 
functionally of human dorsal root receptor 3 (DRR-3) as shown in SEQ ID NO:7. 

14. A substantially pure -polynucleotide encoding a- protein comprisingthe- amino acid- 
sequence consisting functionally of human DRR-3 as shown in SEQ ID NO:7. " 

15. The polynucleotide of claim 14, wherein said polynucleotide encodes a protein 
consisting essentially of the amino acid sequence of SEQ ID NO: 7. 

16. The polynucleotide of claim 15, wherein said polynucleotide has the nucleotide 
sequence of SEQ ID NO :8. 

17. A protein, except as existing in nature, comprising the amino acid sequence consisting 
functionally of human dorsal root receptor 4 (DRR-4) as shown in SEQ ID NO:9. 

18. A substantially pure polynucleotide encoding a protein comprising the amino acid 
sequence consisting functionally of human DRR-4 as shown in SEQ ID NO:9. 

19. The polynucleotide of claim 18, wherein said polynucleotide encodes a protein 
consisting essentially of the amino acid sequence of SEQ ID NO:9. 

20. The polynucleotide of claim 19, wherein said polynucleotide has the nucleotide 
sequence of SEQ ID NO: 10. 
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21. A protein, except as existing in nature, comprising the amino acid sequence consisting 
functionally of human dorsal root receptor 5 (DRR-5) as shown in SEQ ID NO: 1 1 . 

22. A substantially pure polynucleotide encoding a protein comprising the amino acid 
sequence consisting functionally of human DRR-5 as shown in SEQ ID NO: 11. 

23. The polynucleotide of claim 22, wherein said polynucleotide encodes a protein 
consisting essentially of the amino acid sequence of SEQ ID NO: 1 1 . 

24- The^polynucleotideof claim 23, wherein said polynucleotide rhas thenucleotide 
sequence of SEQ ID NO: 12. 

25. A protein, except as existing in nature, comprising the amino acid sequence consisting 
functionally of human dorsal root receptor 6 (DRR-6) as shown in SEQ ID NO: 13. 

26. A substantially pure polynucleotide encoding a protein comprising the amino acid 
sequence consisting functionally of human DRR-6 as shown in SEQ ID NO: 13. 

27. The polynucleotide of claim 26, wherein said polynucleotide encodes a protein 
consisting essentially of the amino acid sequence of SEQ ID NO: 13. 

28. The polynucleotide of claim 27, wherein said polynucleotide has the nucleotide 
sequence of SEQ ID NO: 14. 

29. An antibody made by a process comprising the step of injecting a pharmaceutically 
acceptable preparation comprising the protein of anyone of claims 1, 5, 9, 13, 17, 21, 
or 25, into an animal capable of producing said antibody.. 
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30. A vector for expressing rat DRR-1, comprising the polynucleotide of either of claim 2 
or 3. 

31. A vector for expressing human DRR-1, DRR-2, DRR-3, DRR-4, DRR-5 or DRR-6, 
5 comprising the polynucleotide of either of claims 6, 7, 10, 11, 14, 15, 18, 19, 22, 23, 

26, or 27. 

32. A host cell transformed with a vector according to claim 31. 

10 33. A method for assaying a test compound for its ability to bind or to activate a G 
protein-coupled dorsal root-ganglia-specific receptor (DRR), comprising: 

a) incubating a source containing DRR but substantially free of other G protein- 
coupled receptors, with; 
15 i) a ligand known to bind to DRR; 

ii) said test compound; and 

b) determining the extent to which said ligand binding is displaced by said test 
compound. 

20 
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Abstract 

The present invention is directed to novel G protein-coupled receptors that are found 
predominantly in the dorsal root ganglia. The invention encompasses both the receptor 
proteins as well as nucleic acids encoding the proteins. In addition, the present invention is 
5 directed to methods and compositions which utilize the receptors. 
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